arXiv:1507.07727vl [cs.DS] 28Jul2015 


Prime Factorization of the Kirchhoff Polynomial: 
Compact Enumeration of Arborescences 

Matiis Mihalak 1 , Przemyslaw Uznariski 2 , and Pencho Yordanov 3,4 

department of Knowledge Engineering, Maastricht University, The Netherlands 
2 Helsinki Institute for Information Technology (HUT), Aalto University, Finland 
'department of Biosystems Science and Engineering, ETH Zurich, Switzerland 
4 Swiss Institute of Bioinformatics, Basel, Switzerland 

July 29, 2015 


Abstract 

We study the problem of enumerating all rooted directed spanning trees (arborescences) of a directed 
graph (digraph) G = (V, E) of n vertices. An arborescence A consisting of edges ei,... , e n -i can be 
represented as a monomial ei • ea • • • e n -i in variables e € E. All arborescences arb(G) of a digraph then 
define the Kirchhoff polynomial JO 4 gar b(G) ELgA e - We show how to compute a compact representation 
of the Kirchhoff polynomial its prime factorization, and how it relates to combinatorial properties 
of digraphs such as strong connectivity and vertex domination. In particular, we provide digraph 
decomposition rules that correspond to factorization steps of the polynomial, and also give necessary 
and sufficient primality conditions of the resulting factors expressed by connectivity properties of the 
corresponding decomposed components. Thereby, we obtain a linear time algorithm for decomposing a 
digraph into components corresponding to factors of the initial polynomial, and a guarantee that no finer 
factorization is possible. The decomposition serves as a starting point for a recursive deletion-contraction 
algorithm, and also as a preprocessing phase for iterative enumeration algorithms. Both approaches 
produce a compressed output and retain some structural properties in the resulting polynomial. This 
proves advantageous in practical applications such as calculating steady states on digraphs governed by 
Laplacian dynamics, or computing the greatest common divisor of Kirchhoff polynomials. Finally, we 
initiate the study of a class of digraphs which allow for a practical enumeration of arborescences. Using 
our decomposition rules we observe that various digraphs from real-world applications fall into this class 
or are structurally similar to it. 


1 Introduction and Related Work 

A spanning tree of an undirected graph G is a connected acyclic subgraph containing all vertices of G. In a 
directed graph (digraph for short) G, the analogue is an arborescence, i.e., a subdigraph of G spanning its 
vertex set such that all vertices are reachable from a root vertex along a unique directed path. 

We are interested in the problem of enumerating all arborescences of a digraph G. The famous Tutte’s 
Matrix-Tree Theorem 18] states that the number of arborescences rooted at vertex j in a digraph is equal, 
up to a sign, to the (jj )th minor of the Kirchhoff matrix; the Kirchhoff matrix, which is closely related to 
the Laplacian matrix, is the difference between the diagonal matrix having the in-degree of the vertices on 
its diagonal and the adjacency matrix of the digraph. (This theorem is itself a generalization of the equally 
famous Kirchhoff’s Theorem 9 for undirected graphs.) Thus, counting the number of arborescences can be 
achieved in polynomial time (e.g., by the Gaussian elimination). Arborescences can also be enumerated with 
the Matrix-Tree Theorem by uniquely labeling the edges of the digraph. Then the modified Kirchhoff matrix 
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Figure 1: Left: A digraph with one trivial (the initial vertex) and four non-trivial strongly connected 
components (here cycles), each induced by the edges {aq, xi, £ 3 }, x £ {a,b,c,d}. The gray area depicts one 
such strongly connected component G\V']. Right: An illustration of G\V'\. 


is the difference between the diagonal matrix having the sum of labels of all in-coming edges to each vertex 
on its diagonal and the adjacency matrix constructed from the labels of edges connecting adjacent vertices. 
Summing up all (jj)th minors results in a homogeneous polynomial, called the Kirchhoff polynomial, in which 
each monomial represents an arborescence consisting of the edges corresponding to the labels appearing in 
the monomial. Formally, the Kirchhoff polynomial k(G) of a digraph G over variables e £ E, is defined as 


k(G) = II e ’ 

Aearb(G) eeA 

where arb(G) denotes the set of all arborescences of G. In general, there might be exponentially many 
arborescences for a given G, and their enumeration by explicitly computing the canonical form of re(G) cannot 
be done in polynomial time. 

Gabow and Myers presented an algorithm for enumerating all arborescences with 0(\E\ + N ■ |K|) running 
time (N is the number of arborescences) and 0(\E\) space requirements [5]. Later, this algorithm has been 
improved by the currently two state-of-the-art algorithms |8 19 . Both algorithms start by computing an 
initial (arbitrary) arborescence, and then, iteratively, compute “close-by” arborescences by outputting only 
the edge-difference to the previously computed and listed arborescence. Uno (19 uses a reverse search, while 
Kapoor and Ramesh [ 8 | use depth-first search in the space of all arborescences (represented as an undirected 
graph where a node corresponds to an arborescence, and an edge denotes a single edge-swap between the two 
adjacent arborescences). The algorithm of Uno runs in 0{\E\ + N ■ y/\V\ log(|K|/|U|)) time and has space 
complexity 0(|K|), and the algorithm of Kapoor and Ramesh runs in 0(IVjVj + |U| 3 ) time and has a space 
complexity 0 (|U| 2 ). 

In this paper we propose an alternative approach to the above algorithms, focusing on the form of the 
output, which in our case is predominantly more compressed, and allows for certain symbolic manipulations. 
Our enumeration algorithm is based on a (recursive) factorization of the Kirchhoff polynomial k(G). We 
present two decomposition rules for digraphs, and show that every factor of k(G) corresponds to a digraph 
obtained by applying one of the decomposition rules and is derived from some subdigraph of G. The first 
decomposition rule corresponds to finding a strongly connected component (SCC) of G together with all of 
its in-coming edges ej n , and subsequently contracting all source vertices of ei n into a single new vertex. The 
second decomposition rule is more involved and is based on dominance and immediate dominance of vertices 
(these terms are explained in Section [ 2 ]). Our decomposition is closest to the decomposition that has been 
formulated in the language of principal partition of matroids 12 j. 

To the best of our knowledge, the question of relating combinatorial features of a digraph to the algebraic 
properties of its Kirchhoff polynomial (such as its prime factorisation), although being a fundamental one, 
has not previously been investigated. 

We illustrate the compressing potential of our method on an example in Figure [T] Obviously, every 
arborescence of the example is rooted in r, contains the edge with label o, and contains the edges of type 
Xi and X 2 from every SCC. The only freedom left is choosing whether edge X 4 or edge £5 is part of an 
arborescence. These choices are mutually independent, and as a result, there are 2 4 arborescences, which 
results in 208 explicitly written variables. The factorized representation of the Kirchhoff polynomial (based 
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on the SCC decomposition) is much shorter: 


°( a l a 2®4 + ai® 2 a 5 )(^ 1 ^ 2^4 + ^ 1 ^ 2 ^ 5 )( C 1 C 2 C 4 + C 4 c 2 c 5 )(d 4 (i 2 d 4 + diG^ds), 


which contains 25 explicitly written variables. We can further decompose each factor to finally obtain the 
prime factorization 

00402(04 + <15)6462(64 + 65)0402(04 + 05)^4 <^2(^4 + (65), 

containing only 17 explicitly written variables. 

An exhaustive application of the decomposition rules results in non-decomposable digraphs derived 
from subdigraphs of G, which correspond to prime factors of n(G). The prime decomposition provides a 
compressed form of n(G) which can be easily handled (evaluated) and manipulated (e.g., finding the greatest 
common divisor of two Ivirchhoff polynomials). Moreover, the prime factorization of n(G) retains important 
connectivity properties of the original digraph and provides information about the type of digraphs with 
practically enumerable arborescences, which is beneficial for various practical applications. Furthermore, the 
decomposition/prime factorization can serve as a preprocessing step for the existing iterative enumeration 
algorithms (such as those by Gabow and Myers [5], Uno 19 , or Kapoor and Rarnesh [8]), but can also be 
a starting point for a recursive algorithm, which employs the deletion-contraction step whenever further 
factorization is impossible. Finding the best edges (i.e., the edges that lead to the largest possible compression) 
on which the recursion shall proceed is an interesting open problem. For our experimental evaluation, we 
resort to several simple heuristics that can be of practical use, and that served well for demonstrating the 
potential of our method. 

The rest of the paper is organized as follows. Section [2] gives necessary terminology, definitions, and initial 
observations. Section[3]characterizes the non-decomposability, in terms of the structural properties of the input 
digraph, corresponding to primality of Kirchhoff polynomials. Section [4] lays out the prime decomposition 
rules. Section [5] discusses the applicability of the prime decomposition and compressed enumeration, while 
demonstrating the potential of our method by applying the recursive algorithm to real-world examples. 


2 Preliminaries 

We are dealing with simple directed graphs ( digraphs for short), however all our results also hold for digraphs 
with parallel edges (loops make no sense in our applications, since no arborescence can ever contain one). 
When necessary, we denote the vertices and the edges of a digraph G by V(G) and E(G), respectively. For 
V' C V, G\V'] denotes the induced subdigraph of G by the set of vertices V'. We denote the set of all incoming 
edges to vertex v by inc(u) and when it is not ambiguous, simply by in(u). 

A digraph G is strongly connected if for any two vertices u,v € V there is a directed path from u to 
v and from u to it. A strongly connected component (SCC for short) of G is any largest (w.r.t. vertex 
inclusion) strongly connected induced subdigraph of G. It follows that no two distinct SCCs can share a 
vertex, and, therefore, all strongly connected components G4,..., G& of a digraph G induce a unique partition 
V (Gi), ... ,V (Gk) of V. 

Observe that for two distinct SCCs Gi and Gj there can be a directed path from Gi to Gj, or a path 
from Gj to Gi, but not both. The existence of such paths between SCCs naturally induces a unique partial 
order on the SCCs G4, ..., Gk- 

Recall that an arborescence A is a rooted directed spanning tree of G with edges directed away from the 
root. Observe that in an arborescence A, the root is the unique vertex that can reach every other vertex 
along a directed path in A. We denote by arb(G) the set of all arborescences of G, and by arb„(G) the set 
of all arborescences rooted at vertex v. Let root„(G) be the digraph constructed from G by removing all 
edges incoming to v. All arborescences of root„(G) are then necessarily rooted at v. We say that G is rooted 
at vertex v, if v has no incoming edges, and every other vertex is reachable from v. Observe that every 
arborescence of a digraph rooted at a certain vertex v is itself also a digraph rooted at vertex v, so we find it 
justified to use the same term “rooted” when referring to digraphs and to arborescences. 
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We represent the set of all arborescences in a digraph G with a homogeneous multivariate polynomial 
over the variables e, e £ E, called the Kirchhoff polynomial (or synonymously, the arborescence enumerator ) 
k(G). For such a multivariate polynomial P , we denote by var(P) the set of variables appearing in P (in 
monomials with non-zero coefficients). 

Besides the Kirchhoff polynomial k(G) = X^Aearb(G) Y\eeE(A) e > we sometimes are only interested in arbores¬ 
cences rooted in a specific vertex v. Then we consider the related polynomial n v (G) = Y) ilearb , G] riee,E(/t) e fl 
A polynomial P is a factor of a polynomial Q , if there exists a polynomial R such that Q = P ■ R. 
polynomial P that has only trivial factors is called prime. Similarly, we say that G' is a component (a prime 
component) of G if k(G') is a factor (a prime factor) of n(G). 

In general, results presented in this paper hold even when considering multidigraphs instead of simple 
directed graphs. Through our digraph manipulations we could sometimes obtain multidigraphs (with loops) 
as a result. However, this does not present a problem, as we can transform the multidigraph into a simple 
digraph whose Kirchhoff polynomial is equal to that of the multidigraph: 

Observation 2.1. The following two operations on a multidigraph G preserve k(G): 

1. Removal of all loops, 

2. Replacing multiple parallel directed edges e\, ■ ■ ■, ei going from u to v with a single edge from u to v 

such that e = ei + e 2 + ... + e*. 

Observation 2.2. An arborescence of G exists iff the partial order of the SCCs has exactly one minimal 
element. Such a SCC is called the initial SCC. 

Definition 2.3. Let G[K'] be a SCC of G. By G\V'} we denote the digraph created from G\V'\ as follows 
(cf. Figure [7]b 

• IfG[V'] is the initial SCC , then G[W] = G[V']. 

• Otherwise, we create a new vertex v aux , and for every edge vu, such that u £ V' and v ^ V', we add an 
edge v 3UX u with label v aux u = vu. 

Definition 2.4 (Domination). If G is rooted at v, then we say that vertex u dominates vertex w, if all 
directed paths from v to w go through vertex u. By dom G (u) we denote the set of all vertices of G dominated 
by u. If dome(zt) 7 ^ {zi} and u v, we say that u is a non-trivial dominator. 

Definition 2.5 (Immediate Domination). Let G be rooted at v. Vertex y is called an immediate dominator 
of vertex z, if y 7 ^ z, y dominates z, and for every other vertex x that dominates z, we have that x also 
dominates y. Equivalently we say that z is immediately dominated by y, and we denote such y as parent G (z) 
(it is easy to see that there can be at most one such vertex). The set of all immediately dominated vertices by 
vertex y is denoted by imm G (?/) = {z : parent G (z) = y}. 

Thus, parent G (z) taken over all z £ V defines a directed tree T(G) of immediate domination, rooted at the 
root v of G (note that parent G (c) is undefined). Furthermore, for any vertex y , the immediately dominated 
vertices z £ imm G (?/) induce the following partition of dom G ( 2 /) \ {y}\ |dom G (z) : z £ imm G ( 2 /)}. 

Definition 2.6 (Contraction). For a subset S of the vertices of a digraph G, and for a vertex u £ S, by 
contracted digraph G(S —» u ) we denote the digraph G' constructed from G as follows: 

1. All edges xy, where x £V\S, y £ S\ {u} are removed from G. 

2. All edges within S are removed (i.e., all edges xy, where x,y £ S). 

3. All vertices of S are contracted into a single vertex u. 

1 Please, note that for a singleton G (that is V(G) = {?;}) the empty digraph is its only spanning arborescence, thus formally 
k(G) = 1. 
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Observe that G has no loops, and if G has no parallel edges incoming to u, then G , as well, has no 
parallel edges incoming to u. It can, however, happen that u has parallel outgoing edges. Further, for all 
z £ V \ S: 

• zu £ E{G) if and only if zu £ E(G), and 

• for every edge sz £ E{G) such that s £ S, there is a corresponding edge uz £ E{G). 


3 Primality of Components 


Observe that k(G ) is a special homogeneous polynomial: every monomial of k(G) contains exactly n — 1 
variables, each with exponent equal to one. Obviously, this property needs to also hold for factors of k(G) 
(recall that the monomials of k(G) represent an arborescence of G). Furthermore, observe that no variable e 
can appear in two factors of re(G). 

Proposition 3.1. If P is a factor of k.(G), then all monomials of P have the same number of variables, 
each with exponent equal to one. 

Proposition 3.2. If k(G) = P ■ Q, then var(P) 0 var(Q) = 0. 

We observe that the partitioning of edges into P or Q under factorization k(G) = P ■ Q is induced by a 
partitioning of vertices. 

Lemma 3.3. If k(G) = P ■ Q and v £ V, then either in(v) C var(P) or in(v ) C var(Q). 

Proof. Assume that there are two incoming edges to v , e\ and e-i, such that e\ £ var(P) and e 2 £ var(Q). 
Then there exists a monomial in k(G) (in its canonical form) containing both e\ and e-i. But such a monomial 
cannot represent an arborescence, a contradiction. □ 


Theorem 3.4. Let G be a strongly connected digraph. Then k(G) is prime. 

Proof. Assume, on the contrary, that n(G) = P -Q, and P and Q are nontrivial factors. Let V) and V 2 be the 


set of vertices with incoming edges in var(P) and var(Q), respectively. By Proposition 3.2 and Lemma 3.3 
V\ fl V 2 = 0, and since P and Q are nontrivial, b/0 and V 2 ^ 0. Let Vi,V 2 be arbitrarily picked vertices 
such that V\ £ V\ and V2 £ V 2 . 

Since G is strongly connected, for any v £ V there exists an arborescence of G rooted at v. Let A :i , A 2 be 
two arborescences rooted at V\ and V 2 , respectively. Let pi and P 2 be the monomials from P corresponding 
to the arborescences A\ and A 2 , respectively. In A±, every vertex from Vi, but the root ui, has exactly one 
incoming edge in A\. The label of every such edge necessarily belongs to P. Therefore, for A\, the degree of 
the monomial pi in P is deg(pi) = \Vi \ — 1. On the other hand, for A 2 , all vertices from V\ have an incoming 
edge whose label belongs to P, and therefore the degree of P 2 in P is deg(p2) = |Vi|- Then deg(pi) ^ deg(p2), 
which contradicts Proposition |3.1| □ 


Theorem 3.5. Let G be a digraph rooted at v, such that G\V{G) \ {v}] is strongly connected, and G has no 
non-trivial dominators. Then k(G) is prime. 

To prove the above theorem, we use the following notion, and prove further auxiliary lemmas. Given 
a directed simple cycle C C G, we say that a vertex u £ C is independent from C , if there exists a simple 
path V connecting root v to u, such that V and C are vertex-disjoint (except for u). We call any such V an 
independent path of u (with respect to C). 

Lemma 3.6. Let G be a digraph as in Theorem \3.5\ For any edge wu of G such that u,w 7^ v, there exists 
a simple directed cycle C containing wu, such that C has at least two independent vertices, and w is one of 
those. 


Lemma 3.7. Let G be a digraph as in Theorem 3.5 and let V\ and V 2 be an arbitrary (non-trivial) partition 
of the vertices V \ {a}. There exists a simple directed cycle having an independent vertex from V\ and an 
independent vertex from V 2 . 
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Figure 2: Example of a digraph (left) and its decomposition using strongly connected components (right). 
The edges in black are part of a strongly connected component and the edges in red are connecting different 
SCCs. 


4 Decomposition 

In this section we present two digraph decomposition rules corresponding to factorization steps of the Kirchhoff 
polynomial. The rules are based on the computation of SCCs and the dominator tree of a digraph. The 
exhaustive application of these rules yields digraphs that are prime factors of the Kirchhoff polynomial of the 
original digraph. 

Theorem 4.1. Let G[Vi], G[V 2 ],..., G[T4] be all strongly connected components of a connected digraph G. If 
G has exactly one initial component, then 


k(G) = k(G[Vi]) • k(G[F 2 ]) •... • «(G[Vfc]). 


(1) 


Figure [2] presents an example on how SCC decomposition is employed to factorise k(G). The presented 
decomposition uncovers a fundamental property of arborescences. Namely, it shows that the arborescences of 
a digraph G are in a one-to-one correspondence with all combinations of subdigraphs of G obtained following 
the procedure: i) Pick an arborescence from the initial SCC of G. ii) For every non-initial SCC of G, pick as 
set W an arbitrary (nonempty) subset of all vertices with incoming edges from outside of this SCC, and pick 
the spanning forest of this SCC rooted in W. 


An equivalent formulation (used in the proof of Theorem 4.11 is that any cycle can only be contained in a 


single SCC of G. This property allows us to relate the Kirchhoff polynomial of a digraph to the product of 
Kirchhoff polynomials of digraphs derived from its SCCs. 


Theorem 4.1 allows us to factorize k(G) of any connected digraph G with at least two SCCs. Yet, it is 
not guaranteed that the obtained factorization is prime or non-trivial. Consider the case when the initial 
SCC is composed of a single vertex v, then the theorem states that k(G) = k(G[{i>}]) • n(G\V \ {w}]). We 
can see that this is a trivial factorisation since k(G[{v}]) = 1 and k(G[V \ {e}]) = k(G) (here, G\V \ {u}] 
renames the vertex v to v aux but preserves the arborescences). In Theorem 3.4 we proved that the Kirchhoff 


polynomials of strongly connected digraphs are prime, which implies that the factor corresponding to the 
initial SCC is always prime. We note that the rest of the factors cannot be further non-trivially decomposed 
just using Theorem |4.1| Their primality is unsettled because they lack the property of strongly connected 
digraphs, namely, that any vertex of the digraph is the root of an arborescence, due to possessing a single 
root for all arborescences (the auxiliary vertex). Thereby, we proceed to studying the decomposability of 
non-initial SCC factors. 


With Theorem 4.2 we specify a decomposition step for non-initial SCC factors. More precisely, we provide 
an additional factorization rule of k(G) by using vertex domination relations (with respect to the root vertex 

Vaiix)- 
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Figure 3: Kirchhoff polynomial factorization of a digraph with respect to its dominator tree (above). The 
dominator tree of the example digraph (below). Color coding corresponds to immediate domination. 


Theorem 4.2. Let G be a digraph rooted at v and let u be an arbitrarily picked vertex of G. Denote 
D = dom(j(rt). Then 

k(G) = /<root u (G[D])) • k(G(D -► «)). (2) 


Similarly to SCC decomposition, one can interpret this result as either a one-to-one correspondence 
between arborescences of G and all combinations of the arborescences of its two factors, or as a statement on 
the structure of cyclic sets of edges in G (any cycle is either a cycle when restricted to D, or remains a cycle 
when D is contracted to u). 

Applying Theorem 4.2 to all domination relations defined by the dominator tree T(G) extends the 
decomposition to the whole dominator tree at once. An illustration of the dominator decomposition rule can 
be seen in Figure [3] 

Corollary 4.3. Let G be a digraph rooted at v. Then 


k(G) = JJ k u {H[{u,Ui, ... ,Ui}\) 

u£V 


( 3 ) 


where H = G(domG(ui) — > iq;...; domcj(uj) —>• uf) and {iq, ..., iq} = immg(M). 

Recall Theorem |3.5| which states that if there are no non-trivial domination relations in G (G being rooted 
at v for which no non-trivial decomposition by SCCs applies) then k(G) is prime. Corollary |4.3| ensures that 
all non-trivial dominator relations are eliminated but that cannot guarantee the primality of the decomposed 
factors since they can be SCC decomposable. Therefore, in order to obtain a complete prime factorization of 
the input digraph we need to use both rules of decomposition in an alternating fashion. One might expect 
that this process requires deep recursion. However, for any digraph a constant depth of recursion is needed. 
More precisely, it is enough to apply in sequence SCC factorization, dominators factorization, and SCC 
factorization to get prime components. This upper bound on the recursion depth is tight, as the example in 
Figure [4] shows. 

Theorem 4.4. If G is rooted at v, then any H being a factor obtained by the application of rule ([3]) has the 
following property: all factors of H obtained by applying ([Tj) are prime. 
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Figure 4: Example of a digraph that requires SCC decomposition followed by dominators decomposition and 
another SCC decomposition to obtain prime factorization. 


Pseudocode and time complexity. The pseudocode of the prime decomposition algorithm through 
SCCs and dominator relations can be found as Algorithm [l] in Appendix |7\] Observe that obtaining the SCC 
decomposition takes time 0(|P| + |i?|) by Tarjan’s strongly connected components algorithm 16 . Similarly, 
by the result of Alstrup et al. [3], one can find the dominator tree of a digraph in time 0(|V| + \E\) . 

Corollary 4.5. There is an algorithm that finds the decomposition of G into prime components G i,..., Gk 
in time ©(IKI + \E\). 

5 Applications 

Our decomposition technique has numerous uses, owing to its properties coming from the prime factorization 
of the Kirchhoff polynomial, preservation of structural properties, and compression. Here we illustrate the 
fundamental insight provided by the decomposition, namely that practical arborescence enumeration is not 
directly dependent on the exponentially growing number of arborescences but, rather, depends on digraph 
structure. We show that this structure can be uncovered in real-life networks by means of appropriate edge 
deletion-contraction. Further, we refer to several real-life applications that immediately benefit from the 
decomposition and the compact representation. 

Class of practically enumerable (PE) digraphs. For some digraphs, our decomposition algorithm 
results in a great speed-up compared to standard techniques, and for some digraphs it does not help much 
(just think of a complete digraph). Our factorization rules, however, hint at what the structure of a digraph 
needs to be, in order to allow an effective (practical) enumeration of arborescences. An immediate example 
are digraphs, whose Kirchhoff polynomials factorize exclusively to small (say, constant size) prime factors. 
These digraphs (and others) form the class of practically enumerable digraphs - the class PE for short. In 
this paper we do not aim to classify PE exactly. 

We note that the mechanics of the SCC decomposition (Theorem |4.1[ ) and domination decomposition 
(Theorem |4.2[ ) can be reversed to define a build-up procedure for generating non-trivial digraphs belonging to 
the class PE. For example, given two rooted digraphs P,G £ PE, we can obtain the digraph P o G in which all 
vertices of G are dominated by one particular vertex p from P and k(PoG) = k(P) ■ k(G). Additionally, G can 
be composed of multiple SCCs factorizing as n(G) = ft(-Pi) • ... • /c(P n ), where P\,...,P n £ PE. Interestingly, 
many digraphs obtained from real-life applications possess rich hierarchical and modular structure (4 
which facilitates the practical enumeration of their arborescences. 

Recursive enumeration. The classical edge deletion-contraction result allows to compute k(G) recursively 
even when further factorization is impossible: 

Observation 5.1 (Deletion-Contraction). Let G be a digraph and uv £ E be an arbitrarily chosen edge. 
Then 
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n(G) = k(G \ {uu}) + uv ■ k(G({u, v} —>■ u)). 





Table 1: Evaluation of the performance of the recursive algorithm on experimentally and randomly generated 
digraphs. Lengths of the compressed representation, |G(«(G))|, and of the expanded one, |«(G)|, are measured 
in the number of symbols (edge labels and arithmetic operators). 


G 

M 

\E\ 

|SGG| 

arb(G) 

KG)| 

|G(«(G))| 

|«(G)|/|G(«(G)| 

ColEl 

6 

10 

1 

26 

311 

63 

4.9 

Brain 

8 

28 

8 

5,040 

80,639 

55 

1466 

MDH 

9 

18 

1 

141 

2,537 

199 

12.7 

KNF33 

9 

24 

1 

1,728 

31,103 

2,673 

11.6 

PH05 

12 

35 

1 

53,376 

1,281,023 

19,435 

65.9 

GR 

13 

32 

1 

5,057 

131,481 

5,583 

23.5 

ERG 

17 

55 

1 

44, 741,862 

~ 1.5 • 10 9 

134,483 

11,311 

SF1 

18 

48 

1 

323,167 

11,634,011 

7,913 

1470 

SF3 

40 

105 

1 

2,598,830,145 

~ 2 • 10 11 

815,411 

254,971 

S838 

512 

819 

391 

~ 6.2 • 10 52 

~ 6.3 • 10 55 

1,515 

10 52 

PE45 

1,561 

3,125 

6 

2 10 • 3 775 

~ 2 21 • 3 775 

7,809 

10 372 


This helps us further exploit the prime decomposition by formulating a recursive algorithm alternating 
between prime factorization and edge deletion-contraction in every prime factor. It is an open problem which 
edges to delete-contract in order to obtain a maximally compressed Kirchhoff polynomial. However, various 
compression heuristics can be employed, e.g. by choosing i) an edge whose deletion produces the largest 
number of SCCs in the digraph (a strong bridge), ii) an edge whose deletion introduces the most non-trivial 
domination relationships among vertices, or iii) an edge whose contraction eliminates the greatest number of 
edges. 

We applied the recursive algorithm, using the heuristic of deletion-contraction of the edge that maximizes 
the number of SCCs, to a wide spectrum of real-life and randomly generated digraphs. Performance assessment 
of the recursive algorithm is shown in Table [TJ and detailed description of the digraphs can be found in 
Appendix [C] We obtain small running times and significant compression ratios ranging from 4.9, for a 
small biochemical network with 26 arborescences, to 10 372 , for a large randomly generated PE digraph with 
2 io . 3775 arborescences. The significant compression indicates that the investigated digraphs are PE digraphs 
in practice, and that the recursive algorithm performs well to uncover their structure. Our results further 
show that we can efficiently generate and handle Kirchhoff polynomials of considerable size originating from 
practically relevant digraphs. 


Steady state on digraphs governed by Laplacian dynamics. A prototypical task for which arbores- 
cence enumeration is essential is finding steady states on digraphs with Laplacian dynamics. Particular 
examples include the symbolic derivation of kinetic equations 14 and steady state expressions for biochemical 
systems |6 . However, the exponential growth of the number of arborescences with digraph size has limited 
practical analyses only to small digraphs. In Table [Tj we show that the recursive algorithm performs well and 
produces a compact output on various biochemical networks (described in Table [2] in the appendix) that are 
regarded tedious to study symbolically. 


Greatest common divisor of enumerators. The prime decomposition permits fast calculation of the 
greatest common divisor (GCD) of Kirchhoff polynomials, bypassing costly symbolic manipulations of 
multivariate polynomials. GCDs are required for simplifying ratios of Kirchhoff polynomials and defining 
equivalence classes on them. This is an important tool for the study of ratios of steady states on Laplacian 
systems and facilitates the extension of the notion of spanning edge betweenness 


17 to digraphs. 


Enumeration of specific arborescences. In many situations, one needs to enumerate/count only ar¬ 
borescences of certain type, e.g., those of minimum cost, or those of bounded degree. Our preliminary 
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experimental investigation suggests that our decomposition technique might be of practical/theoretical use. 

Random sampling. Finally, note that we can sample, uniformly at random, an arborescence from the 
compressed form of k(G) as follows: pick a random monomial from each of the (also recursively computed) 
components. 


6 Conclusions 

We studied the fundamental problem of enumerating arborescences via an explicit calculation of the Kirchhoff 
polynomial k(G) of a digraph G. We provided a graph-theoretic structural characterization of digraphs 
corresponding to prime factors of k(G). Based on this we presented a linear-time digraph decomposition 
technique that corresponds to the prime factorization of k(G). Combining the decomposition with the 
standard edge deletion-contraction recursion, we obtained a recursive algorithm that further compresses the 
representation of «(G). We tested the algorithm on digraphs from real-world applications, and obtained short 
running times and compact representations. 
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A Pseudocode 


Algorithm 1 Digraph decomposition corresponding to Kirchhoff polynomial prime factorization. 


1 

function SCCFactors(G) 


2 

Factors -s— [] 


3 

for all S € SCCs(G) do 

r> strongly connected components 

4 

H <- G[S] 

> induced component 

5 

if 3vu G E( G) : v ^ S A u € S then 

> non-initial SCC 

6 

H.ADDVERTEX(v aux ) 


7 

for all vu G E( G) : v ^ S A u £ S do 


8 

H.ADDEDGE(v aux u) 


9 

end for 


10 

end if 


11 

Factors. append(H) 


12 

end for 


13 

return Factors 


14 

end function 


15 



16 

function DominationFactors(G) 


17 

if -iIsRooted(G) then 

> G has to be rooted 

18 

return [G] 


19 

end if 


20 

T -e- DominatorsTree(G, getRoot(G)) 


21 

Factors -G- [] 


22 

H -s— G 

> copy of G 

23 

for u G postorder(T) do 


24 

S -e- T.successors(u) 


25 

Factors. append(MakeRooted(H[S U {u}], u)) 


26 

H.CONTRACTS, u) 

> contract all of S into u 

27 

end for 


28 

return Factors 


29 

end function 


30 



31 

function AllFactors(G) 


32 

Factors -G- [] 


33 

for all G1 G SCCFactors(G) do 


34 

for all G2 G DominationFactors(GI) do 


35 

for all G3 G SCCFactors(G 2) do 


36 

Factors. append(G3) 


37 

end for 


38 

end for 


39 

end for 


40 

return Factors 


41 

end function 
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B Omitted proofs 

B.l Proofs from Section [3] 

For a directed path V and vertices x,y £ V appearing on V in this order, we define a slice V[x : y] as the 
sub-path of P having 1 as a starting vertex and y as an ending vertex. Similarly, for a directed cycle C and 
u,v £ C, we define a slice C[u : u] as the directed path from u to v using the edges of the cycle C. 

Lemma 3.6. Let G be a digraph as in Theorem \3.5[ For any edge wu of G such that u,w ^ v, there exists 
a simple directed cycle C containing wu, such that C has at least two independent vertices, and w is one of 
those. 


Proof. Strong connectivity of G \ {i>} implies the existence of a simple directed path from u to w. This path 
and the edge wu form a directed cycle C containing edge wu. By Menger’s Theorem, there are two directed 
vertex-disjoint paths Vi,V 2 from the root v to vertex w. The paths are not necessarily disjoint with C. Let 
us denote, in reverse order on the cycle C , starting from w, all intersections ci, C 2 ,..., Ck of C with V\ or Vi. 
Assume, without loss of generality, that c k £V\- We construct the cycle C = V\[ck :w]UC'[w: c*,]. Observe 
that V 2 is vertex-disjoint with "Pi, and with C'[w : c*,], since all intersections with C are between Ck and w. 
Thus, V 2 is an independent path for w with respect to C. Similarly, V\ [v : Ck ] is an independent path for Ck 
with respect to C. □ 


Lemma 3.7. Let G be a digraph as in Theorem 3.5 and let V\ and V 2 be an arbitrary (non-trivial) partition 
of the vertices V \ {i>}. There exists a simple directed cycle having an independent vertex from Vi and an 
independent vertex from V 2 . 


Proof. Since G \ {i>} is strongly connected, there exists an edge wu with u £ V 1 and w £ V 2 . By Lemma 3.6 


there is a cycle C and a vertex x such that x and w are independent from C. Without loss of generality, we 
can assume that there are no independent vertices on C between w and x (as we can always pick x to be 
closer to w). If x £ Vj, C satisfies the statement of the Lemma and we are done. Assume therefore that 
x £ V 2 from now on. 

Let Vi and V 2 be the independent paths of w and x, respectively. Since w is not dominating it, there 
exists a v-u path V omitting w. We can assume that V intersects C , as otherwise V is an independent path for 
it, and the Lemma follows. Moreover, V must intersect C[x : w], as otherwise the first intersection z of V with 
C is an independent vertex (with V[v : z] its independent path) that lies between w and x , a contradiction. 

Let y be the last intersection of V with C[x : ir]. We show that after y, V does not intersect V\ nor V 2 . 
Assume for contradiction that V[y : u] intersects path V\ or V 2 , and let y' be the last such intersection. Let 
u be the first intersection of V[y' : it] with C. Observe that u always exists (it is, at the latest, the vertex 
it), and that, by our definition of y and y ', u £ C[w : x\ \ {x}. Let i £ {1, 2} be the index of the path such 
that y' £ Vi. Observe that Pi[v : y'] U V[y' : it] is an independent path from C and thus u is an independent 
vertex between w and x (note that it 7 ^ w), a contradiction. 

Let so = it, Si, s 2 , ■ ■ ■, Sk be all the intersection vertices of V[y : it] and C[w : x] \{ic, a;}. We now iteratively 
define the sequence Co, Ci, c 2 ,..., c 2 d as follows. We set cq = u and ci = Si. Then, for every £ = 1, 2,..., we 
do: If Sk £ C[w : c 2 i-i], then we set Cin. = Sk and terminate the creation of the sequence. Else, let Si be 
the value of c 2 i~i, and we set c 2 e = Sj, where j is the largest j > i such that Sj £ C[w : s*]. Note that it is 
possible that c 2 t = c 2 e-i■ We also set c 2 e+\ = Sj + 1. Observe that c 2 d = Sk- Now, one of the following two 
cases happen: 


cm £ V\‘. Consider a cycle C = C[c 2 d ■ y] U V[y : c 2 <j]. Observe that C[w : c 2 d] U V\ is an independent path 
for cidj and V 2 is an independent path for x (with respect to C), and the Lemma follows. 

c 2 i £ V 2 and c 2 i-2 £ Vi for some i: Consider a cycle C" = V[c2i-i : 02^-2] UC[c 2 i -2 : C2j-i]. Observe that 
C[w : c 2 i- 2] U Vi is an independent path for c 2l _ 2 , and 7 ^[c 2 i+i : c 2l ] UC[c 2l+ 2 : c 2i+ 1] U 'P[c 2 i + 3 : 
C2?:+2] U...U C[c 2 d ■ C2d-\] U V[y : c 2 d] U C[x : y] U V 2 is an independent path for c 2 i (with respect to 
C"), and the Lemma follows. 
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□ 


Theorem 3.5. Let G be a digraph rooted at v, such that G[V(G) \ {v}] is strongly connected, and G has no 
non-trivial dominators. Then n(G) is prime. 


Proof. As in the proof of Theorem 3.4 assume that n(G) = P ■ Q, and P and Q are nontrivial factors. Let 


V\ and V 2 be the set of vertices with incoming edges in var(P) and var(Q), respectively. Due to Lemma 3.3 
V\ and V 2 define a partition of V \ {u}. 

Let us choose C and two vertices x , y according to the statement of Lemma |3.7[ with V x being an 


independent path of a vertex x £ V\, and V v being an independent path of a vertex y £ V 2 . Also, let e x and 
e y be the edges from C ending at x and y , respectively. 

Thus, (C \ {e x }) U V x is a simple path from v to x and there exists an arborescence A x containing this 
path as a subdigraph. Respectively, there exists an arborescence A y with path (C\{e y })UV y as a subdigraph. 
Thus, the polynomial P has to contain a monomial corresponding to and A y [V\]. and the polynomial 

Q has to contain a monomial corresponding to and A^Vj]. Thus, there exists a monomial in the 

product of P and Q which corresponds to the set of edges A = A y [Vi] U A^V^]. However, CCA, and that 
contradicts with the definition of arborescence, since A has to be acyclic. □ 


B.2 Proofs from Section 0] 

We start with the following two observations that relate the unrooted enumerator n(G) and its rooted version 

k v (G): 

Observation B.l. 

k(G) = ^ n v (G) and n v (G) = n(root v (G)). 
vev 

The next observation comes from the fact that in order to construct an arborescence in a digraph, it is 
necessary and sufficient to pick exactly one incoming edge for each vertex except the root. 

Observation B.2. For an arbitrary G, v £ V(G), and {v±,v 2 , ■ ■ ■ ,r' n -i} = V(G) \ {u}: 

n v (G) = ^2 [{ei,..., e n -i} is acyclic] • ei • ... • e„_i, (4) 

e n _i ^in(v n — i) 

where [.] denotes the characteristic function. 

Theorem 4.1. Let G[Vi], G[V 2 ]) • • •, G[T4] be all strongly connected components of a connected digraph G. If 
G has exactly one initial component, then 

K (G) = K(G^])- K (G^ 2 ])-...- K (G^k]). ( 1 ) 

Proof. W.l.o.g. we can assume that G[Vi] is the only initial component of G. Thus k(G) = X^„eVi K v(G) 
and it is enough to show that for any v £ V \: 

K v (G) = K v (G^])-K(G^])-...- K (G^k]). ( 5 ) 

Let us set Vi = {v \, ..., v l n .} for i = 2,..., k and Vi = {v\,v \,..., u^ i _ 1 } U {w}. Additionally, we 

denote by Ui the auxiliary vertex of G[V] (for i = 2 ,..., k). Observe that for i = 2 ,..., k each G[V] is rooted 




at iii. Thus we can rewrite the right side of ® a €l 


^(G[V X ]) • k(G[V 2 }) • ... • k(G[V*]) = k v (G[V i]) • k U 2 (G[V 2 }) 

( \ 


Obs. El 


Obs. E2 


H>i, • ■ ■ > e ni-i} is acyclic] e^.j. 

ejeinpd), 

Ve^-ieinCui^i) j 

/ \ 


X! [{ e i> • ■ ■ > 4 2 } is acyclic] -e\ - 

efe in(vf), 

V4 2 ein’(< 2 ) 

( 

[{ e i’ •■■> e U is acyclic] • ej ■... ■ e: 

efein(t)f), 

Ve^Ginkj 


: (G[y fc ]) = l 


= ■■■ Y1 ([{ e i>-"’ e ni-i} is acyclic] [{d,...,e£j is acyclic]- 

e]ein(fj), e( Gin(v^), 

e* 1 _ 1 ein’(«* l _ 1 ) e* fc £in(i;* fc ) 

• (ei- • •••• (ef- (6) 

Since {v x , v 2 , - • = V( G )\M = (Vi \ M) U V 2 U ... U V k = {uj,..., v^ 1 _ 1 , • ■ •, «i,... both 

Q and ([6]) are summing over the same ranges (in a permuted order). Thus it is enough to prove that 

{ei,..., e„_i} is acyclic in G 

iff 

({e],..., e^ 1 _ 1 j is acyclic in G[Vi]) and ... and ({ef,..., e* } is acyclic in G[T4]), 

where {e},..., ..., {ej,..., e^ k } is the partitioning of {ei, e 2 , ..., e„_i} by the strongly connected 

component to which the target vertex belongs. However, it is enough to notice that any cycle in G can only 
span vertices from a single strongly connected component. O 

Theorem 4.2. Let G be a digraph rooted at v and let u be an arbitrarily picked vertex of G. Denote 
£> = dom(j(rt). Then 

k(G) = /s(root u (G[D])) • k(G(Z> -t «)). (2) 

Proof. Observe that © is equivalent to: 

k v (G) = k u (G[D]) ■ k v (G(D —► u )). 

Let us denote D \ {it} = {iq, v 2 ,..., iq_i}, and let V \ D \ {v} = ..., v n -i}. Thus, V = 

{v \,... ,Vi_i,u,v i+ x ,... jUn-ijW}. 

2 For any x £ Vi, G[Vi] preserves the labels of its incoming edges from the rest of G, thus we can safely write in(:r) without 
explicitly specifying the digraph. 


IV 









Observe, that in G there are no edges going from any vertex from V \ D to any vertex from D \ {it}, as 
otherwise it would not dominate said vertices. If we denote for short ini (it) = me (it) (all incoming edges to 
it) and m 2 (it) = hiQijj^ u \(u) (edges incoming from V\D), we have that m 2 (it) C ini (it). Thus: 

MG(D-M<))= E E [fe.-.®- 1 } is acyclic] • ... e n _i. 

ei6in 2 (u) e i+1 ein(j; i+ i), 
e ri _iGin(v Tl _i) 

Observe that any arborescence of G rooted at v cannot use an edge from ini(u) \ in 2 (it), as that would 
create disconnected digraph (as any path going from v to D has to go through it). Thus: 

mg)= E E E [{ei,... ,e„_i} is acyclic] •ei...e„_ 1 . 

ei€in(vi), eiGin2(u) ei+i€in(w i+ i) 
ei_iGin(vi_i) e n _i€in(w n _i) 


Additionally, in G[D\, all vertices from D \ {it} have the same incoming edges as in G (as taking induced 


subdigraph removes edges coming from outside, but there were no such edges). Thus by Observation B.2 


t (G[D]) = 53 [{ei,... ,ei_i} is acyclic] • ei •...• ej_i. 


eiGin(t>i), 


in 1 


Thus, it is enough to prove that, for e* £ in 2 (it), ei £ in(ui),..., ej_ 1 

v n -i): 

{ei,..., e n _i} is acyclic in G 


€ in(v i - 1 ),e i+1 £ in(i; i+ i),..., e n _i £ 


iff 


({ei,..., e^-i} is acyclic in root u (G{D])) and ({e*,..., e„_i} is acyclic in G(D —► it)) 

To prove it in one direction, observe that any cycle in G[D] remains cycle in G. Similarly, any cycle in 
G(D —> 11 ) remains a cycle in G. To prove it in another direction, observe that any cycle in G (with constrains 
on ei £ in 2 (it)) either spans vertices only from D \ { 11 } thus remains a cycle in G[D], or spans at least one 
vertex from G\(D \ {it}). In the latter case, the digraph remains cyclic when contracting whole D into it. □ 

Theorem 4.4. If G is rooted at v, then any H being a factor obtained by the application of rule ([ 3 ]) has the 
following property: all factors of H obtained by applying ([!]) are prime. 

Proof. Let H be a digraph obtained through the application of rule ([3]) , that is 

H = root u (G(dom G (ui) -¥ iti;... ;dom G (it i ) -> Ui)[{u, m, ... ,u»}]). 


First, observe that in digraph F = root^GIdomc^it)]) the domination relation between vertices (with 
respect to vertex it) is the same as is in G (with respect to vertex u). Thus, as we have that 


H = F(dom G (tti) ^ Hi;... 5 dom G (uj) ->■ m), 


there are no non-trivial dominance relations in H. 

What is now left to prove is that for any strongly connected component V :j ^ {it} of H , the digraph H[Vj] 
has no non-trivial dominators (with respect to u aux , its root). However, for any vertex w £ V) and for any 

path going from u to w in H, there is a path from i> aux to w in H[Vj\ corresponding to the suffix of the former 
one. Thus, if there is any vertex w' dominating (non-trivially) w in H[Vj], then w' dominates (non-trivially) 
iti in H, a contradiction. □ 
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C Experimentally evaluated digraphs 

Description and references of the experimentally evaluated digraphs in Section [5] are shown in Table [2] Note 
that in some application domains, including the Laplacian framework, rooted directed spanning trees for 
which all edges point towards the root, called in-arborescences, have to be enumerated. This is not limiting 
since there is a Injection between the in-arborescences of a digraph G and the (out-) arborescences of the 
transpose digraph of G. 


Table 2: Digraphs to which the recursive enumeration algorithm was applied. Shown are the digraph aliases, 
which type of arborescences were enumerated, and a short description for each digraph. 


Alias 

Enumerated 

Description 

ColEl 

in-arborescences 

Kinetic scheme of the ColEl replication control mechanism from 15 . 

Brain 

out-arborescences 

Brain effective connectivity model from 20 . 

MDH 

in-arborescences 

Proposed kinetic mechanism for the reaction cycle of M.methylotrophus 
methanol dehydrogenase (MDH) with ammonium as activator from 7 . 

KNF33 

in-arborescences 

Three by three classical KNF (Ivoshland, Nemethy, and Filmer) model 
for allosteric regulation of enzymes. 

PH05 

in-arborescences 

Regulation of yeast PH05 gene from 1 . 

GR 

in-arborescences 

Scheme for the catalytic mechanism of glutathione reductase (GR) 
from 13 . 

ERG 

in-arborescences 

Randomly generated Erdos-Renyi digraph with probability for edge 
creation p = 0 . 2 . 

SF1 

out-arborescences 

The largest SCC of a randomly generated scale-free digraph with 
n = 1000 , probability for connecting a new node to an existing one 
chosen randomly according to in-degree distribution a = 0.4, probability 
for adding an edge between two existing nodes /3 = 0.4, and probability 
for connecting a new node to an existing one chosen randomly according 
to out-degree distribution 7 = 0 . 2 . 

SF3 

out-arborescences 

As above, with n = 3000. 

S838 

in-arborescences 

An electronic benchmark circuit from 2|. 

PE45 

in-arborescences 

Randomly generated practically enumerable digraph from prime factors 
with topology {(1,3), (1,2), (2,3), (3,2)}, with recursion depth d — 4, 
and number of prime factors per recursion level n = 5. 
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